This is the first part of a series. Here are all 3 parts:
I’m reading “The Go Programming Language” by Brian Kernighan and Alan Donovan. It is a perfect programming language introduction, clearly written and perfectly structured, with nicely chosen examples. It contains no hand-waving – it’s aware of other languages and briefly acknowledges the choices made in the language design without lengthy discussion.
As an enthusiastic C++ developer, and a Java developer, I’m not a big fan of the overall language. It seems like an incremental improvement on C, and I’d rather use it than C, but I still yearn for the expressiveness of C++. I also suspect that Go cannot achieve the raw performance of C or C++ due to its safety features, though that maybe depends on compiler optimization. But it’s perfectly valid to knowingly choose safety over performance, particularly if you get more safety and more performance than with Java.
I would choose Go over C++ for a simple proof of concept program using concurrency and networking. Goroutines and channels, which I’ll mention in a later post, are convenient abstractions, and Go has standard API for HTTP requests. Concurrency is hard, and it’s particularly easy to choose safety over performance when writing network code.
Here are some of my superficial observations about the simpler features, which mostly seem like straightforward improvements on C. In part 2 I’ll mention the higher-level features and I’ll hopefully do a part 3 about concurrency. I strongly recommend that you read the book to understand these issues properly.
I welcome friendly corrections and clarifications. There are surely several mistakes here, hopefully none major.
No semicolons at the end of lines
Let’s start with the most superficial thing. Unlike C, C++, or Java, Go doesn’t need semicolons at the end of lines of code. So this is normal:
a = b
c = d
This is nicer for people learning their first programming language. It can take a while for those semicolons to become a natural habit.
No () parentheses with if and for
Here’s another superficial difference. Unlike C or Java, Go doesn’t put its conditions inside parentheses with if and for. That’s another small change that feels arbitrary and makes C coders feel less comfortable.
For instance, in Go we might write this:
for i := 0; i < 100; i++ {
...
}
if a == 2 {
...
}
Which in C would look like this:
for (int i = 0; i < 100; i++) {
...
}
if (a == 2) {
...
}
Type inference
Go has type inference, from literal values or from function return values, so you don’t need to restate types that the compiler should know about. This is a bit like C++’s auto keyword (since C++11). For instance:
var a = 1 // An int.
var b = 1.0 // A float64.
var c = getThing()
There’s also a := syntax that avoids the need for var, though I don’t see the need for both in the language:
a := 1 // An int.
b := 1.0 // A float64
d := getThing()
I love type inference via auto in modern C++, and find it really painful to use any language that doesn’t have this. Java feels increasingly verbose in comparison, but maybe Java will get there. I don’t see why C can’t have this. After all, they eventually allowed variables to be declared not just at the start of functions, so change is possible.
Types after names
Go has types after the variable/parameter/function names, which feels rather arbitrary, though I guess there are reasons, and personally I can adapt. So, in C you’d have
Foo foo = 2;
but in Go you’d have
var foo Foo = 2
Keeping a more C-like syntax would have eased C developers into the language. These are often not people who embrace even small changes in the language.
No implicit conversions
Go doesn’t have implicit conversions between types, such as int and uint, or floats and int. This also applies to comparison via == and !=.
So, these won’t compile:
var a int = -2
var b uint = a
var c int = b
var d float64 = 1.345
var e int = c
C compiler warnings can catch some of these, but a) People generally don’t turn on all these warnings, and they don’t turn on warnings as errors, and b) the warnings are not this strict.
Notice that Go has the type after the variable (or parameter, or function) name, not before.
Notice that, unlike Java, Go still has unsigned integers. Unlike C++’s standard library, Go uses signed integers for sizes and lengths. Hopefully C++ will get do that too one day.
No implicit conversions because of underlying types
Go doesn’t even allow implicit conversions between types that, in C, would just be typedefs. So, this won’t compile
type Meters int
type Feet int
var a Meters = 100
var b Feet = a
I think I’d like to see this as a warning in C and C++ compilers when using typedef.
However, you are allowed to implicitly assign a literal (untyped) value, which looks like the underlying type, to a typed variable, but you can’t assign from an actual typed variable of the underlying type:
type Meters int
var a Meters = 100 // No problem.
var i int = 100
var b Meters = i // Will not compile.
No enums
Go has no enums. You should instead use const values with the iota keyword. So, while C++ code might have this:
enum class Continent {
NORTH_AMERICA,
SOUTH_AMERICA,
EUROPE,
AFRICA,
...
};
Continent c = Continent::EUROPE;
Continent d = 2; // Will not compile
in Go, you’d have this:
type continent int
const (
CONTINENT_NORTH_AMERICA continent = iota
CONTINENT_SOUTH_AMERICA // Also a continent, with the next value via iota.
CONTINENT_EUROPE // Also a continent, with the next value via iota.
CONTINENT_AFRICA // Also a continent, with the next value via iota.
)
var c continent = CONTINENT_EUROPE
var d continent = 2 // But this works too.
Notice how, compared to C++ enums, particularly C++11 scoped enums, each value’s name must have an explicit prefix, and the compiler won’t stop you from assigning a literal number to a variable of the enum type. Also, the Go compiler doesn’t treat these as a group of associated values, so it can’t warn you, for instance, if you forget to mention one in a switch/case block.
Switch/Case: No fallthrough by default
In C and C++, you almost always need a break statement at the end of each case block. Otherwise, the code in the following case block will run too. This can be useful, particularly when you want the same code to run in response to multiple values, but it’s not the common case. In Go, you have to add an explicit fallthrough keyword to get this behaviour, so the code is more concise in the general case.
Switch/Case: Not just basic types
In Go, unlike in C and C++, you can switch on any comparable value, not just values known at compile time, such as ints, enums, or other constexpr values. So you can switch on strings, for instance:
switch str {
case "foo":
doFoo()
case "bar":
doBar()
}
This is convenient and I guess that it is still compiled to efficient machine code when it uses compile-time values. C++ seems to have resisted this convenience because it couldn’t always be as efficient as a standard switch/case, but I think that unnecessarily ties the switch/case syntax to its original meaning in C when people expected to be more aware of the mapping from C code to machine code.
Pointers, but no ->, and no pointer arithmetic
Go has normal types and pointer types, and uses * and & as in C and C++. For instance:
var a thing = getThing();
var p *thing = &a;
var b thing = *p; // Copy a by value, via the p pointer
As in C++, the new keyword returns a pointer to a new instance:
var a *thing = new(thing)
var a thing = new(thing) // Compilation error
This is like C++, but unlike Java, in which any non-fundamental types (not ints or booleans, for instance) are effectively used via a reference (it just looks like a value), which can confuse people at first by allowing inadvertent sharing.
Unlike C++, you can call a method on a value or a pointer using the same dot operator:
var a *Thing = new(Thing) // You wouldn't normally specify the type.
var b Thing = *a
a.foo();
b.foo();
I like this. After all, the compiler knows whether the type is a pointer or a value, so why should it bother me with complaints about a . where there should be a -> or vice-versa? However, along with type inference, this can slightly obscure whether your code is dealing with a pointer (maybe sharing the value with other code) or a value. I’d like to see this in C++, though it would be awkward with smart pointers.
You cannot do pointer arithmetic in Go. For instance, if you have an array, you can’t step through that array by repeatedly adding 1 to a pointer value and dereferencing it. You have to access the array elements by index, which I think involves bounds checking. This avoids some mistakes that can happen in C and C++ code, leading to security vulnerabilities when your code accesses unexpected parts of your application’s memory.
Go functions can take parameters by value or by pointer. This is like C++, but unlike Java, which always takes non-fundamental types by (non const) reference, though it can look to beginner programmers as if they are being copied by value. I’d rather have both options with the code showing clearly what is happening via the function signature, as in C++ or Go.
Like Java, Go has no notion of const pointers or const references. So if your function takes a parameter as a pointer, for efficiency, your compiler can’t stop you from changing the value that it points to. In Java, this is often done by creating an immutable type, and many Java types, such as String, are immutable, so you can’t change them even if you want to. But I prefer language support for constness as in C++, for pointer/reference parameters and for values initialized at runtime. Which leads us to const in Go.
References, sometimes
Go does seem to have references (roughly, pointers that look like values), but only for the built-in slice, map, and channel types. (See below about slices and maps.) So, for instance, this function can change its input slide parameter, and that change will be visible to the caller, even though the parameter is not declared as a pointer:
func doThing(someSlice []int) {
someSlice[2] = 3;
}
In C++, this would be more obviously a reference:
void doThing(Thing& someSlice) {
someSlice[2] = 3;
}
I’m not sure if this is a fundamental feature of the language or just something about how those types are implemented. It seems confusing for just some types to act differently, and I find the explanation a bit hand-wavy. Convenience is nice, but so is consistency.
const
Go’s const keyword is not like const in C (rarely useful) or C++, where it indicates that a variable’s value should not be changed after initialization. It is more like C++’s constexpr keyword (since C++11), which defines values at compile time. So it’s a bit like a replacement for macros via #define in C, but with type safety. For instance:
const pi = 3.14
Notice that we don’t specify a type for the const value, so the value can be used with various types depending on the syntax of the value, a bit like a C macro #define. But we can restrict it by specifying a type:
const pi float64 = 3.14
Unlike constexpr in C++, there is no concept of constexpr functions or types that can be evaluated at compile time, so you can’t do this:
const pi = calculate_pi()
and you can’t do this
type Point struct {
X int
Y int
}
const point = Point{1, 2}
though you can do this with a simple type whose underlying type can be const:
type Yards int
const length Yards = 100
Only for loops
All loops in Go are for loops – there are no while or do-while loops. This simplifies the language in one way compared, for instance, to C, C++, or Java, though there are now multiple forms of for loop.
For instance:
for i := 0; i < 100; i++ {
...
}
or, like a while loop in C:
for keepGoing {
...
}
And for loops have a range-based syntax for containers such as string, slices or maps, which I’ll mention later:
for i, c := range things {
...
}
C++ has range-based for loops too, since C++11, but I like that Go can (optionally) give you the index as well as the value. (It gives you the index, or the index and the value, letting you ignore the index with the _ variable name.)
A native (Unicode) String type
Go has a built-in string type, and built in comparison operators such as ==, !=, and < (as does Java). Like Java, Strings are immutable, so you can’t change them after you’ve created them, though you can create new Strings by concatenating other Strings with the built in operator +. For instance:
str1 := "foo"
str2 := str1 + "bar"
Go source code is always UTF-8 encoded and string literals may contain non-ASCII utf-8 code points. Go calls Unicode code points “runes”.
Although the built-in len() function returns the number of bytes, and the built in operator [] for strings operates on bytes, there is a utf8 package for dealing with strings as runes (Unicode code points). For instance:
str := "foo"
l := utf8.RuneCountInString(str)
And the range-based for loop deals in runes, not bytes:
str := "foo"
for _, r := range str {
fmt.Println("rune: %q", r)
}
C++ still has no standard equivalent.
Slices
Go’s slices are a bit like dynamically-allocated arrays in C, though they are really views of an underlying array, and two slices can be views into different parts of the same underlying array. They feel a bit like std::string_view from C++17, or GSL::span, but they can be resized easily, like std::vector in C++17 or ArrayList in Java.
We can declare a span like so, and append to it:
a := []int{5, 4, 3, 2, 1} // A slice
a = append(a, 0)
Arrays (whose size cannot change, unlike slices) have a very similar syntax:
a := [...]int{5, 4, 3, 2, 1} // An array.
b := [5]int{5, 4, 3, 2, 1} // Another array.
You must be careful to pass arrays to functions by pointer, or they will be (deep) copied by value.
Slices are not (deep) comparable, or copyable, unlike std::array or std::vector in C++, which feels rather inconvenient.
Slices don’t grow beyond their capacity (which can be more than their current length) when you append values. To do that you must manually create a new slice and copy the old slice’s elements into it. You can keep a pointer to an element in a slice (really to the element in the underlying array). So, as with maps (below), the lack of resizing is probably to remove any possibility of an pointer becoming invalid.
The built in append() function may allocate a bigger underlying array if it would need more than the existing capacity (which can be more than the current length). So you should always assign the result of append() like so:
a = append(a, 123)
I don’t think you can keep a pointer to an element in a slice. If you could, the garbage collection system would need to keep the previous underlying array around until you had stopped using that pointer.
Unlike C or C++ arrays, and unlike operator [] with std::vector, attempting to access an invalid index of a slice will result in a panic (effectively a crash) rather than just undefined behaviour. I prefer this, though I imagine that the bounds checking has some small performance cost.
Maps
Go has a built-in map type. This is roughly equivalent to C++’s std::map (balanced binary trees), or std::unordered_map (hash tables). Go maps are apparently hash tables but I don’t know if they are separate-chaining hash tables (like std::unordered_map) or open-addressing hash tables (like nothing in standard C++ yet, unfortunately).
Obviously, keys in hash tables have to be hashable and comparable. The book mentions comparability, but so few things are comparable that they would all be easily hashable too. Only basic types (int, float64, string, etc, but not slices) or structs made up only of basic types are comparable, so that’s all you can use as a key. You can get around this by using a basic type (such as an int or string) that is (or can be made into) a hash of your value. I prefer C++’s need for a std::hash<> specialization, though I wish it was easier to write one.
Unlike C++’, you can’t keep a pointer to an element in a map, so changing one part of the value means copying the whole value back into the map, presumably with another lookup. Go apparently does this to completely avoid the problem of invalid pointers when the map has to grow. C++ instead lets you take the risk, specifying when your pointer could become invalid.
Go maps are clearly a big advantage over C, where you otherwise have to use some third-party data structure or write your own, typically with very little type safety.
They look like this:
m := make(map[int]string)
m[3] = "three"
m[4] = "four"
Multiple return values
Functions in Go can have multiple return types, which I find more obvious then output parameters. For instance:
func getThings() (int, Foo) {
return 2, getFoo()
}
a, b := getThings()
This is a bit like returning tuples in modern C++, particularly with structured bindings in C++17:
std::tuple<int, Foo> get_things() {
return make_tuple(2, get_foo());
}
auto [i, f] = get_things();
Garbage Collection
Like Java, Go has automatic memory management, so you can trust that instances will not be released until you have finished using them, and you don’t need to explicitly release them. So you can happily do this, without worrying about releasing the instance later:
func getThing() *Thing {
a := new(Thing)
...
return a
}
b := getThing()
b.foo()
And you can even do this, not caring, and not easily even knowing, whether the instance was created on the stack or the heap:
func getThing() *Thing {
var a Thing
...
return &a
}
b := getThing()
b.foo()
I don’t know how Go avoids circular references or unwanted “leak” references, as Java or C++ would with weak references.
I wonder how, or if, Go avoids Java’s problem with intermittent slowdowns due to garbage collection. Go seems to be aimed at system-level code, so I guess it must do better somehow.
However, also like Java, and probably like all garbage collection, this is only useful for managing memory, not resources in general. The programmer is usually happy to have memory released some time after the code has finished using it, not necessarily immediately. But other resources, such as file descriptors and database connections, need to be released immediately. Some things, such as mutex locks, often need to be released at the end of an obvious scope. Destructors make this possible. For instance, in C++:
void Something::do_something() {
do_something_harmless();
{
std::lock_guard<std::mutex> lock(our_mutex);
change_some_shared_state();
}
do_something_else_harmless();
}
Go can’t do this, so it has defer() instead, letting you specify something to happen whenever a function ends. It’s a annoying that defer is associated with functions, not to scopes in general.
func something() {
doSomethingHarmless()
ourMutex.Lock()
defer ourMutex.Unlock()
changeSomeSharedState()
// The mutex has not been released yet when this remaining code runs,
// so you'd want to restrict the use of the resource (a mutex here) to
// another small function, and just call it in this function.
doSomethingElseHarmless()
}
This feels like an awkward hack, like Java’s try-with-resources.
I would prefer to see a language that somehow gives me all of scoped resource management (with destructors), reference-counting (like std::shared_ptr<>) and garbage collection, in a concise syntax, so I can have predictable, obvious, but reliable, resource releasing when necessary, and garbage collection when I don’t care.
Of course, I’m not pretending that memory management is easy in C++. When it’s difficult it can be very difficult. So I do understand the choice of garbage collection. I just expect a system level language to offer more.
Things I don’t like in Go
As well as the minor syntactic annoyances mentioned above, and the lack of simple generic resource (not just memory) management, I have a couple of other frustrations with the language.
(I’m not loving the support for object orientation either, but I’ll mention that in a later article when I’ve studied it more.)
No generics
Go’s focus on type safety, particularly for numeric types, makes the lack of generics surprising. I can remember how frustrating it was to use Java before generics, and this feels almost that awkward. Without generics I soon find myself having to choose between lack of type safety or repeatedly reimplementing code for each type, feeling like I’m fighting the language.
I understand that generics are difficult to implement, and they’d have to make a choice about how far to take them (probably further than Java, but not as far as C++), and I understand that Go would then be much more than a better C. But I think generics are inevitable once, like Go, you pursue static type safety.
Somehow go’s slice and map containers are generic, probably because they are built-in types.
Lack of standard containers
Go has no queue or stack in its standard library. In C++, I use std::queue and std::stack regularly. I think these would need generics. People can use go’s slice (a dynamically-allocated array) to achieve the same things, and you can wrap that up in your own type, but your type, can only contain specific types, so you’ll be reimplementing this for every type. Or your container can hold interface{} types (apparently a bit like a Java Object or a C++ void*), giving up (static) type safety.